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a class-ll restriction enzymes. According to this inven- 
tion, it is possible to analyse and diagnose variations 
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paring the analyzed pattern of genes expressed in a cell 
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genes. This method is also applicable to the search and 
isolation of genes of physiologically active substances 
that are potential pharmaceuticals or causative genes of 
hereditary diseases, as well as the isolation of those 
genes that are useful for improving agricultural prod- 
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Description 

FIELD OF THE INVENTION 

This invention relates to a method for molecular indexing which is applicable to the analysis and diagnosis of dis- 
eases such as cancers, the search and isolation of genes of physiologically active substances that are potential phar- 
maceuticals or causative genes of hereditary diseases, as well as the isolation of those genes that are useful for 
improving agricultural products. 

BACKGROUND OF THE INVENTION 

For examining differences in gene expression between two tissues, there has been described a method wherein a 
portion (about 50-200 genes) of the expressed gene population is amplified by PCR (the polymerase chain reaction 
method) using any short primers and then separated by polyacrylamide gel electrophoresis [P. Liang and A. B. Pardee, 
Differential display of eukaryotic messenger RNA by means of the polymerase chain reaction., Science 257: 967-971 
(1 992)]. However, in such differential display by means of PCR, only a portion of the whole gene population is amplified 
in principle and yet a plurality of bands are generated from the same gene. Furthermore, such display involves a large 
quantity of artifacts and thus is technically incomplete. Therefore, such cH-^lay only shows differences in gene expres- 
sion between two tissues which are not remote from each other or differences in gene expression in cells. Such differ- 
ential display has a problem that it cannot record the expression of individual genes. 

It is also possible to analyze variations in tissues or cells by determining the level of a particular gene in such tis- 
sues or cells through measuring the amount of its mRNA by Northern blot hybridization method. However, this method 
is not applicable when the target gene is not cloned or the base sequence thereof is unknown. In addition,' this method 
is not suitable for the analysis of a large number of genes. For example, since genes being expressed in a certain cell 
are considered about 10,000 species, it will take for about two years even if Northern blot hybridization is performed for 
100 genes per week. Thus, this method is not practically useful. 

On the other hand, those restriction enzymes which belong to class IIS (hereinafter, referred to as "class-US restric- 
tion enzymes") are restriction enzymes having an ability to cut at a precise distance outside their recognition sites. 
Those fragments cut by a class-IIS restriction enzyme are characterized to have non-identical, cohesive ends consist- 
ing of several nucleotides. There have been known more than 30 class-IIS restriction enzymes including Fok I, Bsm Fl, 
Bsm Al, Bbv I, Sfa Nl and Hga I. It is estimated that genes which have at least one cleavage site of Fok I, Bsm Fl or 
Bsm Al will be 97% of total genes. Brenner et al. has introduced a method of preparing a more detailed genome map 
using a class-IIS restriction enzyme which generates 4-nt (nucleotide) sequences in place of conventional restriction 
enzymes [S. Brenner and K. J. Livak, DNA fingerprinting by sampled sequencing, Proc. Natl. Acad. Sci. U.S.A., 
86:8902-6 (1989)]. There is also disclosed a method wherein a part of restriction enzyme fragments derived from a 
phage or cosmid is amplified by using those adaptors which are complementary to all possible 4-nt cohesive ends gen- 
erated by class-IIS restriction enzymes [D. R. Smith, ligation-mediated PCR or restriction fragments from large DNA 
molecules, PCR Methods Appl. 2:21-27 (1992); Unrau, P. and Deugau, K.V, Gene, 145, 163-169 (1994)]. However, 
though all of these methods employ class-IIS restriction enzymes and use the 4-nt overhangs generated by them as 
means for structural analysis of genomes, unlike the present invention, they do not aim at recording the expression of 
genes in a specific tissue or cell. 

In the Human Genome Project, there is vigorously argued an approach to take a tissue-derived cDNA fragment as 
a sample and to determine a partial sequence thereof as well as its location in a chromosome. In conventional methods, 
cDNA fragments are randomly taken from cDNA library. Accordingly, it is impossible to avoid a repeated sampling of the 
same fragment and there is a tendency that highly expressed fragments are selectively taken. 

It is an object of the present invention to provide a method which can analyze the state of expression of genes or 
deletion due to some abnormalities in a tissue or a cell in a short period and yet easily for a large quantity of genes. 

It is a further object of the present invention to provide a method which is applicable to a rapid isolation of the coding 
region of a protein as well as an amplification of restriction fragments of cloned DNA or genomic DNA. 

SUMMARY OF THE INVENTION 

The present inventor has made extensive and intensive researches toward the solution of the above assignment 
and, as a result, found that, by using class-IIS restriction enzymes or a combination of a class-IIS restriction enzyme 
and a class-II restriction enzyme, it is possible to classify (index) cDNA or DNA into groups in a short period and without 
duplication. Thus, the present invention has been achieved. 

The present invention relates to a method for molecular indexing comprising the following steps (hereinafter 
referred to as "Method I"): 
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(1) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA with a first restriction 
enzyme of class-IIS, 

(2) ligating each of the resultant cDNA fragments to one from a pool of 64 biotinylated adaptors cohesive to all pos- 
sible overhangs, 

(3) digesting the resultant cDNA fragments further with a second and a third restriction enzymes of class-IIS which 
are different from the first class-IIS restriction enzyme used in (1) above to thereby obtain a first cDNA sample, 

(4) obtaining a second cDNA sample by repeating the above steps (1) to (3) wherein the second class-IIS restric- 
tion enzyme is used for the initial digestion and the first and the third class-IIS restriction enzymes are used for the 
subsequent digestion, 

(5) obtaining a third cDNA sample by repeating the above steps (1) to (3) wherein the third class-IIS restriction 
enzyme is used for the initial digestion and the first and the second class-IIS restriction enzymes are used for the 
subsequent digestion, 

(6) recovering each of the resultant ligation samples by using streptavidin-coated paramagnetic beads and then 
removing from the samples the oligonucleotide complementary to an adaptor-primer to be used in (7), 

(7) amplifying each of the resultant cDNA samples by PCR using an adaptor-primer and one of anchored oligo-dT 
primers, 

(8) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes of 
the fragments obtained. 

The present invention also relates to a method for molecular indexing comprising the following steps (hereinafter 
referred to as "Method II"): 

(1) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA, or DNA with a restriction 
enzyme of class-ll, 

(2) ligating each of the resultant cDNA or DNA fragments to an adaptor cohesive to ends generated by the class-ll 
restriction enzyme, 

(3) digesting the resultant cDNA or DNA fragments further with a restriction enzyme of class-IIS, 

(4) ligating each of the resultant cDNA or DNA fragments to one from a pool of 64 biotinylated adaptors cohesive 
to all possible overhangs, 

(5) recovering the resultant ligated sample by using streptavidin-coated paramagnetic beads and then removing 
from said sample the oligonucleotides complementary to adaptor-primers to be used in (6), 

(6) amplifying the resultant cDNA or DNA sample by PCR using adaptor-primers, 

(7) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes of 
the fragments obtained. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a schematic illustration for the principle of Method I. 

Fig. 2 shows structures of cDNA which has been synthesized by reverse-transcribing RNA using a mixture of 3 oli- 
gonucleotides as primers. 

Fig. 3 shows a schematic illustration for the principle of Method II. 

Fig. 4 shows an example of the polyacrylamide electrophoresis pattern of mouse liver RNA obtained by Method I. 
Fig. 5 shows an example of the polyacrylamide electrophoresis pattern of an amplified product from mouse liver 
RNA obtained by Method II. 

EFFECT OF THE INVENTION 

According to Method I, it is possible to examine the state of expression of those genes having cleavage sites of 
class-IIS restriction enzymes (97% of total genes are estimated to have such sites when Fok I, Bsm Al and Bsm Fl are 
used) in a tissue with one to two week experiment per one human subject, since a small number of DNA sub-groups 
will do for this analysis. Furthermore, according to Method I, since the number of fragments amplified from one gene is 
only one in principle, genes can be classified (indexed) into sub-groups without redundancy. Therefore, by comparing 
the analyzed pattern between normal and abnormal tissues by using Method I, it is possible to diagnose variations such 
as tumors easily, correctly and promptly. In addition, Method I is also applicable to the search and isolation of genes of 
physiologically active substances that are potential pharmaceuticals or the causative genes of hereditary diseases, as 
well as the isolation of those genes that are useful for improving agricultural products. 

On the other hand, according to Method II, the target of analysis is not limited to RNA (or cDNA reverse-transcribed 
therefrom), since oligo-dT primers for poly A are not used as primers. According to Method II, it is also possible to 
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amplify restriction fragments of cosmid DNA or genomic DNA. Therefore, Method It is applicable to the mapping of 
these DNAs. 

In addition, regions amplified by PCR is not restricted to non-coding regions and thus it is not necessary to obtain 
clones of upstream regions in order to know the primary structure of a protein. 

5 

DETAILED DESCRIPTION OF THE INVENTION 

[I] Hereinbelow, the steps, action and effects of Method I will be described with reference to Fig. 1 . 

10 (1) First, the total RNA of a cell or a tissue is converted to cDNA with a reverse-transcriptase and the resultant 
cDNA is digested with a first class-IIS restriction enzyme. 

(2) One from a pool of 64 biotinylated adaptors described below is ligated to the resultant cDNA fragments with E. 
coli DNA ligase. Each adaptor has a 4-nt 5' end overhang wherein the outermost base is a mixture of A, C, G and 
T and the inner three bases are one of ail possible sequences. (These adaptors must not be phosphorylated at their 

is 5' ends which form protruding cohesive ends.) At this point, the restriction fragments are classified into 64 sub- 
groups. 

(3) Subsequently, the cDNA fragments are further digested with a second and a third class-IIS restriction enzymes 
which are different from the first class-IIS restriction enzyme used in (1) above to thereby obtain a first cDNA sam- 
ple. 

20 A second cDNA sample is obtained by repeating the above steps (1) to (3) wherein the second class-IIS 

restriction enzyme is used for the initial digestion and the first and the third class-IIS restriction enzymes are used 
for the subsequent digestion, and also a third cDNA sample is obtained by repeating the above steps (1) to (3) 
wherein the third class-IIS restriction enzyme is used for the initial digestion and the first and the second class-IIS 
restriction enzymes are used for the subsequent digestion. 

25 (4) As a result of digestion with the 3 class-IIS restriction enzymes described above, there are produced fragments 
which have lost poly A [see Fig. 1, (i)] and fragments which still have poly A [see Fig. 1, (ii)]. Of these fragments, 
the former ones which have lost poly A will no longer be amplified in the subsequent amplification step and only the 
latter ones with poly A will be amplified. Accordingly, the latter fragments are further classified into 64 x 3 = 192 sub- 
groups at this point depending on the cleavage site nearest the poly A side (i.e., depending on the cleavage site of 

30 which of the three restriction enzymes used). 

(5) Subsequently, the ligation sample is recovered with streptavidin-coated paramagnetic beads and the cDNA 
fragments are treated with a dilute alkaline solution. By these operations, the oligonucleotide complementary to an 
adaptor-primer to be used in (6) is removed (the oligonucleotide will become an inhibitor against PCR reaction). 

(6) The resultant cDNA sample is amplified by PCR by using a combination of an adaptor-primer and one of 
35 d(T) 25 A, d(T) 25 C and d(T) 2 sG which are anchored oligo-dT primers. Depending on the base (T C or G) adjacent to 

the poly(A) tail, fragments amplified by the above three oligo-dT primers are determined. At this point, the cDNA 
fragments are further classified into 192 x 3 = 576 groups. 

(7) The amplified products are separated by denaturing polyacrylamide gel electrophoresis and the sizes of the 
fragments obtained are automatically recorded by a sequencer. 

40 

The above-described procedures are repeated with 64 adaptors, 3 class-IIS restriction enzymes and 3 anchored 
oligo-dT primers. Therefore, an RNA population is classified into 576 groups. With respect to class-IIS restriction 
enzymes, it is estimated that 97% of genes have at least one cleavage site of Fok I, Bsm Al or Bsm Fl. Accordingly, by 
using these 3 restriction enzymes in the method of the invention, it is theoretically possible to recover and present with- 

45 out redundancy almost all of one total RNA population. 

In addition, the above method (Method I) of the invention may be similarly carried out in a modified method which 
is different from the above only in the following points. In step (2) above, one from a pool of 256 biotinylated adaptors is 
used. Each adaptor of the pool has a four-nucleotide 5' end overhang wherein the sequence is one of all possible 
sequences. The second digestion with class-HS restriction ^ Byrnes described in (3) above is not carried out. 

so In this modified method, an RNA population is classified into 768 groups since 256 adaptors and 3 anchored oligo- 
dT primers are used. 

Further, in Method I, a mixture of the following oligonucleotides may be used as primers when converting the total 
RNA from a cell or tissue into cDNA with a reverse transcriptase: 
5' OH-GGATCCT 16 A-3' 
55 5'OH-CAGCTGT 16 C-3' 
5' OH-CTCGAGT 16 G-3' 

When such primers are used, there can be obtained cDNA molecules which have T, G or C adjacent to poly (A) on 
the 5' side and a 6-base sequence added to the outside (3* side) of poly (A) (see Fig. 2). 
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in this case, amplification is carried out by using any one of 5' OH-GGATCCT 16 A-3' [instead of the above anchored 
oligo-dT primer d(T) 25 A], 5' OH-CAGCTGT 16 C-3* [instead of the above d(T) 25 C] and 5' OH-CTCGAGT 16 G-3' [instead 
of the above d(T) 2 sG]. According to these procedures, analysis can be more correct because, in addition to the specif- 
icity to cDNA of only one base of the 3' end of primers, specificity to cDNA by the 6-base sequence of the 5' end of prim- 
5 ers is utilized. 

The target RNA for Method I of the invention is isolated and purified from, for example, body tissues such as hemat- 
opoietic tissues including bone marrow, peripheral blood, lymphocytes, etc. or cells in a body fluid by conventional 
methods such as the guanidine thiocyanate method and the phenol-chloroform extraction method and then incubated 
with a reverse transcriptase and deoxyribonucleotide triphosphates for reverse-transcription into cDNA, 

w With respect to the class-IIS restriction enzymes used in Method I of the invention, there is no particular limitation 
as long as the restriction enzyme forms a 5'*protruding cohesive end consisting of 4 bases. Specific examples include 
commercially available Fok I (Takara Shuzo) and Bsm Al and Bsm Fl (both manufactured by NEB). These three restric- 
tion enzymes may be used in combination for the initial digestion (with one enzyme) and the subsequent digestion (with 
two enzymes). In the modified method, one of these three enzymes may be used. 

75 In Method I of the invention, the biotinylated adaptor means the adaptor consisting of i) an oligonucleotide of 24-27 
nucleotides which forms a 4-nt 5' protruding cohesive end wherein the outermost base is a mixture of A, C, G and T, 
and inner three bases are one of all possible sequences, and ii) an oligonucleotide which is complementary to the oli- 
gonucleotide i), shorter by 4 bases and biotinylated at the 5' end. Thus, there are 64 kinds of the biotinylated adaptors. 
In the modified method, the biotinylated adaptor means the adaptor consisting of i) an oligonucleotide of 24-27 

20 nucleotides which forms a 4-nt 5' protruding cohesive end wherein the sequence is one of all possible sequences, and 
ii) an oligonucleotide which is complementary to the oligonucleotide i), shorter by 4 bases and biotinylated at the 5' end. 
Thus, there are 256 kinds of the biotinylated adaptors. 

In order to allow E. coli DNA ligase to recognize the 3 bases of a cDNA fragment adjacent to the binding site, phos- 
phorylation of the 5' ends of the above adaptors which form cohesive ends is not carried out. 

25 In Method I of the invention, one of the two primers used for PCR is an oligonucleotide having a common sequence 
with the oligonucleotide constituting the adaptor described above which is subjected to ligation to cDNA at 3* end (= 
adaptor-primer). As a marker which labels this adaptor-primer, those which are used in conventional analysis may be 
used. Specific examples include fluorescent dyes, radioactive materials and enzymes. 

In Method I of the invention, another primer used for PCR is one of three oligo-dT primers, of which 3' end base is 

so A, C or G, These primers may be synthesized by a commercial nucleic acid synthesizer. 

[II] Hereinbelow, the steps, action and effects of Method II will be described with reference to Fig. 3. 

(1) First, DNA or cDNA of a cell or tissue is digested with a class-ll restriction enzyme (EcoRI is used in Fig. 3). 
35 (2) An adaptor which is cohesive to ends generated by the class-ll enzyme is ligated to each of the DNA or cDNA 
fragments with T4 DNA ligase (the adaptor must be phosphorylated at the 5' end which form cohesive ends). 

(3) The resultant DNA or cDNA sample is further digested with a class-IIS restriction enzyme (Bsm Al is used in 
Fig. 3). 

(4) One from a pool of 64 biotinylated adaptors described below is ligated to each of the resultant cDNA or DNA 
40 fragments with E. coli DNA ligase. Each adaptor has a 4-nt 5' end overhang wherein the outermost base is a mix- 
ture of A, C, G and T, and the inner three bases are one of all possible sequences. (These adaptors must not be 
phosphorylated at their 5' ends which form cohesive ends.) At this point, the restriction fragments are classified into 
64 groups. 

(5) Subsequently, the ligation sample is recovered with streptavidin-coated paramagnetic beads and the DNA or 
45 cDNA fragments are treated with a dilute alkaline solution. By these operations, those oligonucleotides comple- 
mentary to adaptor-primers which will become inhibitors against PCR reaction are removed. 

(6) Amplification by PCR is carried out using two adaptor-primers. The one derived from the adaptor for ends gen- 
erated by the class-ll enzyme is referred to as "adaptor-primer 1 " and the other derived from the biotinylated adap- 
tors is referred to as "adaptor-primer 2". Details will be described afterwards. 

so (7) The amplified products are separated by denaturing polyacrylamide gel electrophoresis and the sizes of the 
fragments obtained are automatically recorded by a sequencer. 

By using a class-ll restriction enzyme, a class-IIS restriction enzyme and 64 biotinylated adaptors in the operations 
described above, the DNA or cDNA fragments generated by the class-ll and class-IIS restriction enzymes used can be 
55 separated and displayed. 

When cDNA which has been reverse-transcribed from RNA is used as a target of analysis of Method II, a cDNA 
sample is prepared as follows. RNA is isolated and purified from, for example, body tissues such as hematopoietic tis- 
sues including bone marrow, peripheral blood, lymphocytes, etc. or cells in a body fluid by conventional methods such 
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as the guanidine thiocyanate method and the phenol-chloroform extraction method and then incubated with a reverse 
transcriptase and deoxyribonucleotide triphosphates for reverse-transcription into cDNA. 

It is also possible to use DNA as a target of analysis of Method II. In this case, a DNA sample is prepared as follows. 
DNA isolated from, for example, body tissues such as hematopoietic tissues including bone marrow, peripheral blood, 
lymphocytes, etc. or a cell suspension in a body fluid is crushed with polytron or the like and incubated with proteinase 
K to thereby degrade proteins. Then, the reaction solution is subjected to phenol extraction and 2 volumes of ethanol is 
added to the aqueous layer for precipitation. The precipitate is treated with ribonuclease (RNase) not containing deox- 
yribonuclease (DNase) to thereby remove RNA. 

With respect to the class- II restriction enzyme used in Method II of the invention, there is no particular limitation as 
long as the enzyme recognizes a specific base sequence, cut the site specifically and generate cohesive ends. Specific 
examples include EcoRI, BamHI, Hindlll, Belli, Bglll, Sail, Xhol, Accl, Aval, Sau3A, Taql, Motl (which form S'-protruding 
cohesive ends), and Pstl, Sad, Kpnt, Haell (which form 3'-protruding ends). 

In particular, for the analysis of genomic DNA, restriction enzymes which recognize a 8-base sequence (e.g., Notl) 
are preferably used. 

With respect to the ciass-IIS restriction enzyme used in Method II of the invention, there is no particular limitation 
as long as the enzyme generates 4-base S'-protruding cohesive ends. Specific examples include commercially availa- 
ble Fok I (Takara Shuzo) and Bsm Al, Bsm Fl, SfaNI and Bbvl (all manufactured by NEB). 

It is also possible to use 2 or 3 class-IIS restriction enzymes in combination to increase the number of groups as 
described in Method I. 

In Method II of the invention, the adaptor consists of i) an oligonucleotide of 20-30 nucleotides forming a 5'- (or 3'- 
) overhang which is cohesive to ends of restriction fragments, and ii) an oligonucleotide which is complementary to the 
above oligonucleotide i) and shorter by the number of bases forming the overhang. 

The adaptor must be phosphorylated at its 5' end (which form a cohesive end) so that an adaptor oligonucleotide 
is bound to the DNA strand which is recovered with streptavidin-coated beads. 

In Method II of the invention, the biotinylated adaptor means the adaptor consisting of i) an oligonucleotide of 24- 
27 nucleotides which forms a 4-nt 5' protruding cohesive end wherein the outermost base is a mixture of A, C, G and T 
and inner three bases are one of all possible sequences, and ii) an oligonucleotide which is complementary to the oli- 
gonucleotide i), shorter by 4 bases and biotinylated at the 5' end. Thus, there are 64 kinds of the biotinylated adaptors. 

In order to allow E. coli DNA ligase to recognize the 3 bases of a cDNA fragment adjacent to the binding site, phos- 
phorylation of the 5' end of the above biotinylated adaptor which form a cohesive end is not carried out. 

In Method II, one of the primers used in PGR is an oligonucleotide having a common sequence with the oligonu- 
cleotide constituting the adaptor described above which is subjected to ligation to cDNA or DNA fragments at its 3' end 
(adaptor -primer 1) 

In Method II of the invention, another primer used for PCR is an oligonucleotide having a common sequence with 
the oligonucleotide constituting the biotinylated adaptor described above which is subjected to ligation to cDNA or DNA 
fragments at its 3' end (adaptor-primer 2). As a marker which labels this adaptor-primer, those which are used in con- 
ventional analysis may be used. Specific examples include fluorescent dyes, radioactive materials and enzymes. 

These primers may be synthesized by using a commercial nucleic acid synthesizer. 

Potential target diseases which may be analyzed or diagnosed by Method I or Method II of the invention include 
malignant tumors such as brain tumor, stomach cancer, large intestine cancer, breast cancer, uterus cancer, skin can- 
cer, prostate cancer and malignant melanoma; virus infections such as herpes group infections, chronic hepatitis, 
cytomegalovirus infection and acquired immunodeficiency syndrome; and multifactorial hereditary diseases such as 
diabetes and hypertension. 

PREFERRED EMBODIMENTS OF THE INVENTION 

The present invention will be described in more detail below with reference to the following Reference Example and 
Examples, which are provided for the purpose of explanation and should not be construed as limiting the scope of the 
invention. 

[Reference Example] Preparation of cDNA 

(1) Purification of RNA by ultracentrifugation 

Mouse livers lyophilized in dry ice or liquid nitrogen were crushed with a homogenizer. To the crushed material, 5 
volumes of a GuCNS solution was added at room temperature and agitated with a vortex mixer. To a 10 ml polyallomer 
tube, 3.5 ml of 5.7 M CsCi/0.1 M EDTA solution was added and 6 ml of the resultant sample was layered over and then 
centrifuged overnight at 15°C at 32000 rpm using Beckman L70 centrifuge. 
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(2) Recovery of the RNA after ultracentrifugation 

The tube was removed from the rotor and all of the supernatant was discarded. The tube wall was wiped and dried. 
Thereafter, the precipitate was dissolved in 300 |il of TE buffer. 

(3) Ethanol precipitation 

To the aqueous layer, 1/10 volume of 3 M potassium acetate (pH 5.0) was added, mixed gently and placed in ice. 
Then, 2.5 volumes of ice-cooled ethanol was added to the above mixture and mixed gently The resultant mixture was 
left at -80°C for several hours and centrifuged at 4 °C for 5 minutes to precipitate RNA. The ethanol was discarded. The 
RNA precipitate was washed with ice-cooled 70% ethanol and re-centrifuged to precipitate RNA. After the ethanol was 
discarded, the RNA precipitate was dried. 

The above precipitate was dissolved in about 100 uJ of sterile distilled water per 1 g of the tissue cells to obtain an 
RNA solution (RNA concentration = approx. 5 ug/ul). 

(4) Preparation of cDNA template 

(4-1) Preparation of single-stranded cDNA molecules 

First, the resultant RNA and oligo-dT primers only were heated at 70 °C for 2-3 minutes. Then, other reagents were 
added thereto and kept at 37 °C for i hour to synthesize cDNA molecules. 



* Composition of the reaction solution 


5x Reverse transcriptase buffer (Gibco-BRL) 


4 uJ 


2mM dNTP (Pharmacia) 


4 uJ 


0.1MDTT 


2ul 


10 pmol/ul S'-amino (dT) 18 


1 nl 


Total RNA (3 ug) and distilled water 


7.5 ul 


RNase inhibitor ' 1) (40 u/ul) (Toyobo) 


0.5 til 


200 u/ul M-MLV Reverse transcriptase' 25 (Gibco-BRL) 


1 ul 


1 ) derived from human placentas 
2 > Molony Murine Leukemia Virus 



(4-2) Synthesis of double-stranded cDNA molecules 

The reaction solution described below was added to the single-stranded cDNA reaction solution and kept at 16 °C 
for 2 hours to thereby prepare double-stranded cDNA molecules. After the completion of the reaction, 3 \i\ of 0.25 M 
EDTA (pH 7.5) and 2 ul of 5 M NaCI were added thereto. Then, phenol extraction and ethanol precipitation were con- 
ducted and the precipitate was dissolved in 240 uJ of distilled water. 



* Composition of the reaction solution 


10 mM MgCI 2 


70 ul 


1 M Tris-CI (pH 7.5) 


10 ul 


1 M (NH 4 ) 2 S0 4 


1.5 ul 


RNase H (Toyobo) (1 u/ul) 


1.5 ul 


E. coli DNA polymerase I (Toyobo) (10 u/ul) 


4.5 ul 
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[Example 1] Analysis by the DNA molecular indexing method 



(1) Digestion with a class-IIS restriction enzyme (initial digestion) 

The cDNA prepared in Reference Example described above was digested with a restriction enzyme by keeping the 
cDNA in any one of the following reaction solutions (A) to (C) at a specified temperature under specified conditions. 



* Composition of the reaction solution (A) (using Fok I) 


10xM buffer 


10 nl 


0.1 % BSA (Takara Shuzo) 


10^1 


cDNA sample 


80 nl 


Fok I (Takara Shuzo) (10 u/^l) 


0.5 |il 


Kept at 37 °C for 50 minutes to 1 hour. 



* Composition of the reaction solution (B) (using Bsm Al) 


1 0x buffer for Bsm Al (NEB) 


10 Ml 


0.1 % BSA 


I0|il 


cDNA sample 


80 jul 


Bsm Al (NEB) (5 u/^l) 


M 


Kept at 55 °C for 50 minutes to 1 hour. 



35 


* Composition of the reaction solution (C) (using Bsm Fl) 




1 0x H buffer 


10 ^1 




Distilled water 


10^1 


40 


cDNA sample 


80 liI 




Bsm Fl (NEB) (5 u/\x\) 






Kept at 65 °C for 50 minutes to 1 hour. 



45 

After the completion of each of the reactions (i), (ii) and (iii) above, 3 \i\ of 0.25 M EDTA (pH 7.5) and 2 jut of 5 M 
NaCI were added to each reaction solution. Then, phenol extraction and ethanol precipitation were conducted and each 
precipitate was dissolved in 70 |J of distilled water. 

so (2) Addition of adaptors 

To the cDNA fragments obtained in (1) above, one of the following adaptors having the sequences described below: 
CI T adaptors: 

5'-B-GTACATATTGTCGTTAGAACGCT-3* 
55 S'-NXYZAGCGTTCTAACGACAATATGTAC-S' 
or 

C1G adaptors: 

5'-B-GTACATATTGTCGTTAGAACGCG-3' 
5'-NXYZCGCGTTCTAACGACAATATGTAC-3' 
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(wherein B represents biotin; N represents any of the four bases; and XYZ represents one of the 64 possible 
sequences. When YZ = AA, AT, TA OR TT C1G adaptor were used. Otherwise, C1T adaptors were used.) 
were added and kept in the following reaction solution at 16°C overnight, to thereby ligate the cDNA fragments to the 
adaptors. 



* Composition of the reaction solution 


10x E. coli DNA ligase buffer 


1 M* 


100 mM (NH 4 ) 2 S0 4 


1 M" 


1 pmol/^il adaptor solution 


1 M> 


cDNA sample digested with a class-IIS restriction enzyme 


1 Ml 


E. coli DNA ligase 


3 units 


Distilled water to make 10 |il 



(when the sequence XYZ did not contain G nor C, 5 pmol/^il adaptor solution and 30 units of E. coli DNA ligase 
20 were used.) 

(3) Digestion with class-US restriction enzymes (the second digestion) 

The cDNA fragments obtained in (2) above were further digested with class-IIS restriction enzymes by keeping the 
25 cDNA sample at a specified temperature under the conditions specified below: 

(i) When a Fok I digest was used: 

40 H of distilled water and 5 \x\ of 10x H buffer were added. 
Bsm Fl (1 unit) was added and kept at 65°C for 50 minutes. 
so Bsm Al (1 unit) was added and kept at 55°C for 50 minutes. 

(ii) When a Bsm Al digest was used: 

40 |il of distilled water and 5 j*l of 10x T buffer were added. 
Fok I (1 unit) was added and kept at 37°C for 50 minutes. 
Bsm Fl (1 unit) was added and kept at 65°C for 50 minutes. 
35 (iii) When a Bsm Fl digest was used: 

40 jJ of distilled water and 5 til of 10x M buffer were added, 

Fok I (1 unit) was added and kept at 37°C for 50 minutes. 

Bsm Al (1 unit) and 1 *il of 4 M NaCI were added and kept at 55°C for 50 minutes. 

40 (4) Amplification by PCR 

(4-1) Recovery of the adaptor molecules with paramagnetic beads 

Immediately before use, streptavidin-coated paramagnetic beads were washed twice with 0.1% BSA and once with 
45 1x B&W buffer (10 mM Tris-CI pH 7.5, 1 M NaCI, 1 mM EDTA) and then suspended in an equal volume of 1x B&W 
buffer. 

To each sample, 15 ul of 5 M NaCI and 5 |il of the paramagnetic beads were added, left stationary for 15 minutes 
and washed with 1x B&W buffer once. Then, 10 jul of 0.1 M NaOH was added thereto and left stationary for 5 minutes. 
Thereafter, the resultant mixture was washed with 50 ^l of 0.1 M NaOH once, with 1 x B&W buffer once and with distilled 
so water twice. 

(4-2) PCR reaction 

The reaction solutions having the compositions described below were placed in an Eppendorf tube and heated at 
55 96°C for 1 minute to allow a prompt initiation of reactions. Then, a thermal cycle consisting of 30 seconds at 94°C, 1 
minute at 50°C and 1 minute at 72°C was repeated 25 to 35 times. After an extension step was carried out at 72°C for 
20 minutes, the reaction solution was cooled to room temperature. 
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* Compositions of the reaction solutions (per 
(i) Enzyme reaction solution 


one sample) 


10x PCR buffer for Stoffei fragment 






2 mM dNTP 




1 n> 


25 mM MgCI 2 




1.2 mI 


Distilled water 




4.3 |il 


10 u/pl Stoffei fragment' 1 ) 




0.05 Ml 


(ii) Primer reaction solution 


10 pmol/nl fluorescent-CU 




0.5 mI 


10 pmol/pl d(T) 25 A [or d(T) 25 C, d(T) 25 G] 




2 mI 



' A portion of AmpliTaq DNA polymerase fragment (Perkin 
Elmer) 



H,-n ^ P U IP 6 combinations of JOE-C1T and d(T) 25 A; FAM-ClTand d(T) 25 C; and TAMRA-C1T and 

a(T) 25 G. [JOE: 2\7'-dimethoxy-4\5-dichloro-6-carboxyfluorescein, FAM: 5'-carboxyfluorescein, TAMRA' 6-carboxy- 
tetramethyl rhodamine (all manufactured by Perkin Elmer; the sequence of C1 T: d(GTACATATTGTCGTTAGAACGCT)] 
Alternatively, when C1G adaptors were used, the composition of the primer reaction solution is- 



10 pmol/ul fluorescent-ClG 



10 pmol/ul d(T) 25 A [ord(T) 25 C, d(T) 25 G] 



0.5 
Ml 

2 Ml 



The primers are used in the combinations of JOE-C1G and d(T) 25 A; FAM-C1G and d(T) 25 C; and TAMRA-C1G and 
d(T) 25 G. [JOE: 2\7'<fimethoxy-4\5-dichloro-6-carboxyfluorescein, FAM: S'-carboxyfluorescein. TAMRA 6-carboxy- 
tetramethyl rhodamine (all manufactured by Perkin Elmer; the sequence of C1G: d(GTACATATTGTCGTTA- 
GAACGCG)]. 

(4-3) Preparation of Electrophoresis Samples 

^-r/T, e f, h ° f the feaCt ' 0n P roducts - a sam P |e was teken as follows: 1 ul from the combination of FAM-C1 and 
d(T) 25 C, 3 ul from the combination of JOE-C1 and d(T) 25 A and 3 ul from the combination of TAMRA-C1 and d(T) 25 G 
To each sample, 5 ul of T4 DPase solution having the following composition was added and reacted at 37°C for 40 min- 
utes. 



* Composition of T4 DPase solution (per one sample) 


10xM buffer 


0.5 mI 


2 mM dNTP 


0.5 mI 


Distilled water 


4 mI 


T4 DNA polymerase (Toyobo) 


1 unit 



After ethanol precipitation of the reaction solution, 3.5 ul of a buffer (80% formaldehyde, 1 0 mM EDTA 6 mg/ml blue 
dextran) was added to the sample (i.e.. precipitate), heated at 95°C for 4 minutes, then immediately applied to the sam- 
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pie well of ABI 373A electrophoresis apparatus (Perkin Elmer) and run (at a constant electric power of 30W for 13 
hours). 

Fig. 4 shows one example of the electrophoresis patterns obtained. 

[Example 2] Analysis by the DNA molecular indexing method 

(1) Digestion with a class-ll restriction enzyme 

The cDNA prepared in Reference Example described above was digested with a restriction enzyme by keeping the 
cDNA in the following reaction solution at a specified temperature under specified conditions. 



* Composition of the reaction solution (using EcoRI) 


10*high salt buffer (attached to the enzyme) 


5 mI 


cDNA Sample 


45 nl 


EcoRI (Toyobo or Takara Shuzo) 


Sunits 


Kept at 37°Cfor 1 hour. 



After the completion of the reaction, phenol extraction and ethanol precipitation were carried out and the total pre- 
cipitate was used for the subsequent reaction. 

(2) Addition of adaptors 

To the cDNA fragments obtained in (1) above, the following adaptors 
5'-P- AATTCTTAACCAGGCTGAACTTGCTC-3' 
5'-OH-GAGCAAGTTCAGCCTGGTTAAG-3' 
were ligated by keeping the cDNA sample in the following reaction solution at 16°C for 16 hours or more. 



* Composition of the reaction solution 


10x ligation buffer (similar to Toyobos) 


2ul 


2.5 pmol/ul EcoRI adaptors 


2ul 


T4 DNA ligase 


150 units 


Total volume 


20 til 



After the completion of the reaction, phenol extraction and ethanol precipitation were carried out and the total pre- 
cipitate was used for the subsequent reaction. 

(3) Digestion with a class-IIS restriction enzyme 

The cDNA treated in (2) above was further digested with a restriction enzyme by keeping the cDNA sample in the 
following reaction solution at a specified temperature under specified conditions. 
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* Composition of the reaction solution (using Bsm Al) 


lOx buffer for Bsm AI(NEB) 


1Gul 


0.1% BSA 


10 nl 


cDNA sample 


80 til 


Bsm Al (NEB) (5u/yi\) 


0.5 |il 


Kept at 65°C for 50 minutes to 1 hour. 



After the completion of the reaction, phenol extraction and ethanol precipitation were carried out and the precipitate 
was dissolved in 30 ^il of purified water. 

(4) Addition of biotinylated adaptors 

To the cDNA fragments obtained in (3) above, the following adaptors: 

C1T adaptors: 5'-B-GTACATATTGTCGTTAGAACGCT-3' 

5'-OH-NXYZAGCGTTCTAACGACAATATGTAC-3' 
C1 G adaptors: 5'-B-GTACATATTGTCGTTAGAACGCG-3' 

5'-OH-NXYZCGCGTTCTAACGACAATATGTAC-3' 

(wherein B represents biotin; N represents any of the four bases; and XYZ represents one of the 64 possible 
sequences. When YZ = AT or TA, C1G sequences were used. Otherwise, C1T sequences were used.) 
were ligated by keeping the cDNA sample in the following reaction solution at 16°C overnight. 



* Composition of the reaction solution 


10xE. coli DNAIigase buffer 


M 


100 mM (NH 4 ) 2 S0 4 


M 


1 pmol/pl adaptor solution 


1 Ml 


cDNA fragments digested with a class- IIS restriction enzyme 


M 


E. coli DNAIigase 


3 units 


Distilled water 


to make 10 jlxI 



(when the sequence XYZ did not contain G nor C, 5 pmol/jal adaptor solution and 6 units of E. coli DNA ligase 
were used.) 

(5) Amplification by PCR 

(5-1) Recovery of the adaptor molecules with paramagnetic beads 

Immediately before use, streptavidin-coated paramagnetic beads were washed twice with 0.1% BSA and once with 
1x B&W buffer (10 mM Tris-CI pH 7.5, 1 M NaCI, 1 mM EDTA) and then suspended in an equal volume of 1x B&W 
buffer. 

To the sample, 1 5 ul of 5 M NaCI and 5 nl of the paramagnetic beads were added, left stationary for 1 5 minutes and 
washed with 1x B&W buffer once. Then, 10 yA of 0.1 M NaOH was added thereto and left stationary for 5 minutes. 
Thereafter, the resultant mixture was washed with 50 ul of 0. 1 M NaOH once, with 1 x B&W buffer once and with distilled 
water twice. 
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(5-2) PCR reaction 

The reaction solutions having the compositions described below were placed in an Eppendorf tube and heated at 
96°C for 1 minute to allow a prompt initiation of reactions. Then, a thermal cycle consisting of 30 seconds at 94°C , 1 
minute at 50°C and 1 minute at 72°C was repeated 25 to 35 times. After an extension step was carried out at 72°C for 
20 minutes, the reaction solution was cooled to room temperature. 



* Compositions of the reaction solutions (per one sample) 


(i) Enzyme reaction solution 


10x PCR buffer for Stoffel fragment 


1 ul 


2 mM dNTP 


1 ul 


25 mM MgCI 2 


1.2 mI 


Distilled water 


4.3 ul 


10 u/|il Stoffel fragment' 1 * 


0.05 ul 


(ii) Primer reaction solution 


10 pmol/ul fluorescent-CIS primer 


0.5 Ml 


10 pmol/ul k gt10 forward primer 


0.5 ul 



* 1 > A portion of AmpliTaq DNA polymerase fragment (Perkin 
Elmer) 



The two kinds of primers having the following sequences are used in combination: 
5 ' -OH -GTAC ATATTGTC GTTAG AAC G C -3'( C 1 S primer) 
5'-OH-GAGCAAGTTCAGCCTGGTTAAG"3'(^ gt10 forward primer) 

(5-3) Preparation of Electrophoresis Samples 

A 3 pi sample was taken from the reaction products and 5 ul of T4 DPase solution having the following composition 
was added thereto. The resultant mixture was reacted at 37°C for 40 minutes. 



* Composition of T4 DPase solution (per one sample) 


10xM buffer 


0.5 ul 


2 mM dNTP 


0.5 ul 


Distilled water 


4 ul 


T4 DNA polymerase (Toyobo) 


1 unit 



After ethanol precipitation of the reaction solution, 3.5 ul of abutter (80% formaldehyde, 10 mM EDTA, 6 mg/mi blue 
dextran) was added to the sample (i.e., precipitate), heated at 95°C for 4 minutes, then immediately applied to the sam- 
ple well of ABI 373 A electrophoresis apparatus (Perkin Elmer) and run (at a constant electric power of 30 W for 13 
hours). 

Fig. 5 shows one example of the electrophoresis patterns obtained. 
Claims 

1 . A method for molecular indexing comprising the following steps: 

(1) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA with a first restriction 
enzyme of class-IIS, 
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(2) ligating each of the resultant cDN A fragments to one from a pool of 64 biotinylated adaptors cohesive to all 
possible overhangs, 

(3) digesting the resultant cDNA fragments further with a second and a third restriction enzymes of class-IIS 
which are different from the first class-IIS restriction enzyme used in (1) above to thereby obtain a first cDNA 

5 sample, 

(4) obtaining a cDNA second sample by repeating the above steps (1) to (3) wherein the second class-IIS 
restriction enzyme is used for the initial digestion and the first and the third class-IIS restriction enzymes are 
used for the subsequent digestion, 

(5) obtaining a third cDNA sample by repeating the above steps (1) to (3) wherein the third class-IIS restriction 
10 enzyme is used for the initial digestion and the first and the second class-IIS restriction enzymes are used for 

the subsequent digestion, 

(6) recovering each of the resultant ligation samples by using streptavidin-coated paramagnetic beads and 
then removing from said samples the oligonucleotide complementary to an adaptor-primer to be used in (7), 

(7) amplifying each of the resultant cDNA samples by PCR using an adaptor-primer and one of anchored oligo- 
15 61 primers, 

(8) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes 
of the fragments obtained. 

2. The method of claim 1 , wherein the class-IIS restriction enzymes are used in the combination of Fok I, Bsm Ai and 
20 Bsm Fl. 

3. The method of claim i , wherein the cDNA has been synthesized by reverse-transcripting a tissue- or cell-derived 
RNA using a mixture of the following oligonucleotides as primers: 

5' OH-GGATCCT 16 A-3' 
25 5 T OH-CAGCTGT 16 C-3' 
5'0H-CTCGAGT 16 G-3' 

4. A method for molecular indexing comprising the following steps: 

30 (1 ) digesting cDNA which has been reverse-transcribed from tissue- or cell-derived RNA, or DNA with a restric- 

tion enzyme of class-ll, 

(2) ligating each of the resultant cDNA or DNA fragments to an adaptor cohesive to ends generated by the 
class-ll restriction enzyme, 

(3) digesting the resultant cDNA or DNA fragments further with a restriction enzyme of class-IIS, 

35 (4) ligating each of the resultant cDN A or DNA fragments to one from a pool of 64 biotinylated adaptors cohe- 

sive to all possible overhangs, 

(5) recovering the resultant ligation sample by using streptavidin-coated paramagnetic beads and then remov- 
ing from said sample the oligonucleotides complementary to adaptor-primers to be used in (6), 

(6) amplifying the resultant cDNA or DNA sample by PCR using adaptor-primers, 

40 (7) separating the amplified products by denaturing polyacrylamide gel electrophoresis and recording the sizes 

of the fragments obtained. 

5. The method of claim 4, wherein the class-IIS restriction enzyme is any one of Fok I, Bsm At, Bsm Fl, Sfa Nl or Bbv I. 



50 



55 
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FIG. 2 



When the base adjacent to poly A on the 5' side is T: 

" — TAi sGGATCC 
AT i sCCTAGG 

When the base adjacent to poly A on the 5' - side is G: 

" — GA i . CA6CTG 
CT, sGTCGAC 

When the base adjacent to poly a on the 5' side is C: 

CA, jCTCGAG 

— GT, .GAGCTC 
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FIG. 3 
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FIG. 4 
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H Lone I 8: 



Horizontal axis: Fragment size 
Vertical axis: fluorescence intensity 

Primers: (from the upper panel) FAM-C1T and d(T) 25 C; JOE-C1T 

and d(T) 25 A; TAMRA-C1T and d(T) 25 G 
Adaptor: NCCC 

Sample: Mouse liver cDNA (Fok I was used for the initial 
digestion. ) 
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FIG. 5 




■ Lane t 1; 



Horizontal axis: Fragment size 
Vertical axis: Fluorescence intensity 

Primer: TAMURA CIS 

Adaptor: NAAC for the upper panel, NCAG for the lower panel 
Sample: Mouse liver cDNA (digested with EcoRI and Bsm AI) 
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